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Amendments to the Specification: 

Please amend the title on the front page of the application as follows: " Method and 
Apparatus for Predication Using Micro Micro-Operation Generator for Deriving a Plurality of 
Single-Destination Micro-Operations from a Given Predicated Instruction ". 

Please amend the title preceding paragraph [0001] at page 1 of the published application 
as follows: " M e thod and Apparatus for Pr e dication Using Micro Ops Micro-Operation 
Generator for Deriving a Plurality of Single-Destination Micro-Operations from a Given 
Predicated Instruction ". 

Please amend paragraph 7 as indicated: 

[00036] Efficiency of the renaming mechanism in an out-of-order processor [[my]] may 
drive processor performance. That is, the renaming mechanism and its associated physical 
register file may represent critical resources for an out-of-order processor architecture. 
Implementation of predication on such processors poses interesting issues. 

Please amend paragraph 1 1 as indicated: 

[00011] FigsFig.. 3 is a flow diagram illustrating at least one embodiment of a generalized 
execution pipeline for an out-of-order processor. 

Please amend paragraph 23 as indicated: 

[00023] The resultant micro-operations 175a 175n 172a-172n , including the standard micro- 
ops and any additional predication support micro-ops, each specify a single destination register, 
even if the original instruction 120 indicates multiple destination registers. It is the resultant 
micro-operations 175a 175n 172a- 172n that are executed, rather than the predicated instruction 
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120, by execution units 175. Of course, ellipses in Fig. 1 indicate that additional operations 
(such as, for example, register renaming) may be performed on the resultant micro-operations 
175a 175n 172a-172n before execution by the execution units 175. 

Please amend paragraph 25 as indicated: 

[00025] Fig. 2 illustrates additional micro-architectural details for at least one embodiment 
of a processor 101a. Processor 101a includes physical registers 104 and a rename map table 102. 
The processor 101 also includes an out-of-order ("OOO") register rename unit 106. The OOO 
rename unit 106, map table 404 -1 02 and physical registers 104 are all utilized for the purpose of 
OOO register renaming. 

Please amend paragraph 32 as follows: 

[00032] Brief reference to Fig. 3 illustrates an embodiment of the execution pipeline 300 
mentioned above. Fig. 3 illustrates that generating micro-ops during decomposition of an 
instruction may occur during a (i-op generation phase [[31 10]]310 of an execution pipeline 300. 
The illustrative pipeline 300 illustrated in Fig. 3 includes the following stages: instruction 
pointer generation 302, instruction fetch 304, instruction decode 306, architectural register 
rename 308, |n-op generation 310, physical register rename 311, dispatch 3 12, execution 313, 
and instruction retirement 314. The pipeline 300 illustrated in Fig. 3 is illustrative only; the 
techniques described herein may be used on any processor. For an embodiment in which the 
processor utilizes an execution pipeline 300, the stages of a pipeline 300 may appear in different 
order than that depicted in Fig. 3. 
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Please amend paragraph 35 as follows: 

[00035] The second instruction, instruction b, is a predicated instruction indicating a single 
destination register, rl . If the predicate value in qp is true, the add operation indicated by 
instruction b is net to be performed. In such case, a "second" result value of rl is generated. 
(Again, the value may actually reside in a physical rename register such one of the physical 
register 104s illustrated in Fig. 2). However, if the predicate value in qp is false, then the add 
operation indicated by instruction b will not be performed. In such case, the value of rl after 
execution of statement b, when qp is false, is the "first" value of rl. Essentially, then, instruction 
b is a select operation. 

Please amend paragraph 37 as indicated: 

[00037] Fig. 5 illustrates that, in order to simplify the hardware that executes a predicated 
instruction having more than two source operands, a sequence of three micro-operations 502, 
501, 506 503, 504 may be generated for instruction b (500). The following sequence of micro- 
operations may be produced by the (i-op generator 1 16 (Figs. 1, 2) for sample instruction b set 
forth above. That is, predicated instruction b may be decomposed by (i-op generator 1 16 into 
the following multiple micro-ops, where each micro-op has only two sources and only one 
destination: 

502. append temp = rl, qp 

503. add rl=r2,r3 

504. cmov rl = temp, rl 
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Please amend paragraphs 39-41 as follows: 

[00039] Micro-op 504503 , the add operation, is essentially a normal non-predicated add 
micro-op. It indicates only one destination register and , at most, two source registers. This type 
of instruction is referred to herein as a "standard" micro-op. A standard micro-op addresses the 
basic operation indicated by the instruction 120 (such as instructions 1-5 set forth in Table 1), 
without consideration of predication. 

[00040J Micro-op 506 504 ., the conditional move operation, is a predication-support micro-op 
that effects a selection between "first" and "second" values of rl . The conditional move 
("cmov") micro-op 506504 uses the extra bit appended to temp, which represents the qp value 
for the original predicated instruction 500, to select between the data value of temp (the original, 
or "first," value for rl) and the new, or "second," value of rl computed by the add instruction 
50 4503 . If the appended bit is true, then the predicate register for the original instruction 500 
indicated a "true" value. In such case, the new "second" value of rl is copied over itself into rl. 
If, however, the appended bit is false, then the predicate register for the original instruction 500 
indicated a "false" value. In such case, the original "first" value of rl is moved from temp back 
into rl. 

[00041] Fig. 5 illustrates that the hardware 520 to execute the cmov instruction 506504is 
much simpler than the hardware 400 for execution of instruction b (500) that is illustrated in Fig. 
4. After decomposition, simplified execution hardware 520 for the cmov instruction 506504only 
receives two inputs: temp and rl . 
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